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Wheat Lodging Ratio Detection Based on UAS Imagery 
Coupled with Different Machine Learning and Deep 
Learning Algorithms 


Paulo FLORES, ZHANG Zhao’ 


(Department of Agricultural and Biosystems Engineering, North Dakota State University, Fargo, ND 58102, USA) 


Abstract: Wheat lodging is a negative factor affecting yield production. Obtaining timely and accurate wheat lodging 
information is critical. Using unmanned aerial systems (UASs) images for wheat lodging detection is a relatively 
new approach, in which researchers usually apply a manual method for dataset generation consisting of plot images. 
Considering the manual method being inefficient, inaccurate, and subjective, this study developed a new image pro- 
cessing-based approach for automatically generating individual field plot datasets. Images from wheat field trials at 
three flight heights (15, 46, and 91 m) were collected and analyzed using machine learning (support vector machine, 
random forest, and K nearest neighbors) and deep learning (ResNet101, GoogLeNet, and VGG16) algorithms to test 
their performances on detecting levels of wheat lodging percentages: non- (0%), light (<50%), and severe (>50%) 
lodging. The results indicated that the images collected at 91 m (2.5 cm/pixel) flight height could yield a similar, 
even slightly higher, detection accuracy over the images collected at 46 m (1.2 cm/pixel) and 15 m (0.4 cm/pixel) 
UAS mission heights. Comparison of random forest and ResNet101 model results showed that ResNet101 resulted 
in more satisfactory performance (75% accuracy) with higher accuracy over random forest (71% accuracy). Thus, 
ResNet101 is a suitable model for wheat lodging ratio detection. This study recommends that UASs images collect- 
ed at the height of about 91 m (2.5 cm/pixel resolution) coupled with ResNet101 model is a useful and efficient ap- 
proach for wheat lodging ratio detection. 
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1 Introduction 


In addition to being as a major source of pro- 
tein and energy"), wheat (Triticum aestivum L.) pro- 
vides a number of essential elements for human 
health, such as iron, vitamin B, and fiber”. The 
global wheat production has been increasing gradu- 
ally, following the footsteps of technology develop- 
ment, and in 2020 a new historical record was estab- 
lished with the production of 7.7x10" kg". Howev- 
er, wheat production still faces a myriad of challeng- 
es threatening crop yield, such as rust and tan spot 


B drought stress'!, and crop lodg- 


diseases", pests 
ing’, among which the wheat lodging issue ranking 
the top”. 

Wheat lodging, the permanent displacement of 
plant stems from the original and natural upright po- 
sition, can be caused by inclement weather condi- 
tions (e. g., rain and wind), over-density planting 
population, and over application of fertilizer”. 
Lodging can lead to yield losses of up to 60%—80%, 
due to lowered photosynthesis rate of the wheat 
plants!"”. The lodging can deteriorate grain quality 
(e. g., less protein) and worsen grain losses during 
harvest, because the combine harvester has difficul- 
ties to pull the lodged crops into the header"''”. 
Lodging has been a topic of interest for researchers, 
agronomists, farmers, and even crop insurance com- 
panies for decades. Wheat plant breeders are inter- 
ested in lodging information for the purpose of se- 
lecting lines/cultivars that are more resistant to lodg- 
ing; plant science researchers and agronomists col- 
lect lodging information so that they can identify its 
causes; farmers can use lodging rates to choose the 
better variety fitted to environmental conditions; 
crop insurance companies would take advantage of 
the information to calculate the insurance coverage 
for farmers". 


In recent years, there has been an increase in 


adoption on newer technologies in agriculture, espe- 


cially on specialty" and row"! 


crops. However, 
when it comes to wheat lodging detection and its as- 
sociated metrics, the mostly used approach still ful- 
ly relies on individual walking the field in-person 
and visually assessing lodging conditions™™?", 
There are many disadvantages associated with this 
approach. First, it is infeasible for one to reach parts 
of a lodged wheat field under certain conditions, 
such as immediately after heavy rain, when one 
might have to wait for a few days before the fields 
are accessible. Second, the evaluation process is in- 
efficient, as the evaluator has to move across the 
field, which can take time. Third, the results from 
different individuals may vary greatly, as each one 
might have different standards (subjectively). In ad- 
dition, the results for the same person on the same 
plot/field of multiple evaluations may be differ- 
ent”. Hence, there is a need for the development of 
an efficient, objective, and low-cost approach for 
wheat lodging detection. 

Researchers explored a number of approaches 
for wheat lodging detection, starting with taking ad- 
vantage of satellite images”, Though the perfor- 
mance of satellite images on crop lodging has been 
validated®*", its wide adoption as a universal ap- 
proach is challenging due to its low spatial (finest 
as tens of centimeters) and coarse temporal (multi- 
ple days) resolution. Since the satellite images can- 
not meet the practical application requirements, re- 
searchers started to explore other potential technolo- 
gies. Most recently, researchers started to explore 
the potential of using unmanned aerial system 
(UAS) images for crop lodging detection given the 
recent development on UASs, sensors, and associat- 
ed software for data processing, coupled with new 
and effective machine learning (ML) and deep 


learning (DL) algorithms””. Compared to the satel- 
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lite images, the UASs images have several advan- 
tages. First, the resolution of the collected UAS aeri- 
al images is much finer than those by satellite, with 
some of them reaching millimeter level. In addition, 
UASs allow for a shorter revisit time of the same ar- 
ea (temporal resolution) compared to satellite, given 
weather conditions are suitable for flights, allowing 
even for more than one flight a day over the same 
area of interest. During the past few years, there has 
been a sharply increase on research exploring the 
use of UASs for crop lodging—though it is at a na- 
scent stage, study results have demonstrated the po- 
tential on different crops, such as sugar beet, cano- 
la, and wheat”! So far, a majority of the studies 
treat crop lodging as a binary issue—lodging or 
non-lodging. From a plant breeder perspective, 
though the lodging and non-lodging information is 
crucial, it would be more meaningful to know the 
lodging as a percentage of each experimental unit 
(plot), which can be a key piece of information 
when making breeding decisions. However, few 
studies have been conducted for this specific purpose. 

Plot lodging ratio detection starts with the data- 
set (individual plot images) generation. Currently, 
the main approach for dataset generation is manual 
—researchers crop the plot images one by one!” 
Though the manual approach has been widely used, 
it has a number of shortcomings, such as inefficien- 
cy and inaccuracy. Since there is no free automatic 
approach for plot dataset generation, researchers 
have no other choices but to take advantage of the 
manual method to collect data. Considering that 
breeders usually have field trials with hundreds or 
even thousands of plots for field trials, there is a 
need for an automatic approach for the dataset gen- 
eration to replace the manual process. 

After generating the proper dataset, the next 


step is to extract features from individual image, 


and then feed them into a classifier (model). The 
proper selection of features to extract is a crucial 
step—if selected features cannot represent the im- 
age characteristics satisfactorily, it would generate a 
low model accuracy; only when the selected fea- 
tures represent the image characteristics well, it 
would obtain a good model accuracy. The selection 
of features mainly relies on domain knowledge, and 
researchers usually take advantage of accumulated 
experience for feature selection. Mardanisamani et 
al." and Chauhan et al.’ demonstrated that textur- 
al and color features have performed well on crop 
lodging detection. With the advent of DL, research- 
ers start to use the DL models to address the classifi- 
cation issues. Very recently, there has been reports 
of a number of DL models with satisfactory perfor- 
mance on solving agriculture-related issues”. How- 
ever, only few studies have compared the wheat 
lodging detection accuracies between DL and ML. 

UAS mission height is a crucial factor for the 
quality of the collected images and the data collec- 
tion time. Considering the same area of interest, 
UASs flights carried out at higher altitude (e.g., 90 m 
above ground level) would result in shorter flight 
time and imagery with lower resolution, compared 
to flights carried out at a lower altitude (e.g., 15 m 
above ground level). Compared to the low resolu- 
tion images, processing high resolution ones re- 
quires much more time and more hard drive space 
for storage. Though the mission height is crucial for 
crop lodging monitoring, the current method for de- 
termining the mission height is mainly empirically 
based. Therefore, there is a need to explore the ef- 
fects of different UAS flight heights on crop lodg- 
ing detection accuracy and to determine proper 
height to collect images for such evaluations. 

This study addressed the main issues associat- 


ed with the predominant approach to determine the 
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occurrence of lodging in wheat research trials. The 
objectives of this study were to: (1) develop an auto- 
matic approach for wheat lodging plot dataset gener- 
ation from UAS imagery; (2) explore the mission 
height effects on wheat lodging ratio detection accu- 
racy, and (3) compare the performance of ML and 


DL on wheat lodging ratio detection. 
2 Materials and methods 


The procedures used in this study, including 
developing and testing an automatic wheat plot data- 
set generation method, using and comparing DL and 
ML models for lodging ratio detection, and quanti- 
fying the height effects on wheat lodging detection 


accuracy, are schematically outlined on Fig. 1. 


UAS color images 
Three heights 
Image pre-processing 
Stitching and geo-reference 
Generate plot datasets 
Automatic approach 
Modeling using different ML and DL algorithms 
SVM, RF, and KNN; ResNetl 01, GoogLeNet and VGG16 


Study recommendation 
Model + mission height with higher detection accuracy 


Field visual grading on plot 
lodging ratio 
Non-, light, and severe lodging 


Note: Non-, light, and severe lodging represent 0%, <50%, and > 
50% crop lodged, respectively; SVM—support vector machine; RF— 
random forest; KNN—K nearest neighbors; three heights are 15, 46, 


and 91 m 
Fig. 1 Workflow of wheat plot lodging ratio detection under 


different mission heights for unmanned aerial systems 


After lodging occurrence, researchers used a 
UAS to capture images from the research area at dif- 
ferent heights. Following the flights, the lodging oc- 
currence in each plot was visually graded into one 
of the three categories: non- (0% crop lodged), light 
(<50% crop lodged), and severe lodging (>50% 
crop lodged). For each flight height, images were 
first processed to obtain an orthomosaic map for the 
area of interest, which was then fed into the newly 


develop plot image dataset generation method for 


dataset generation. Then, different ML and DL mod- 
els were tested, and their performance was com- 
pared. All the procedures used in this study are de- 


scribed in details in the subsequent sections. 
2.1 Experimental field and data collection 


A field rented from a local farmer near Thomp- 
son, ND, U.S. was used to plant the experimental 
trials during the 2020 growing season (May-Au- 
gust, 2021). The location and experimental fields 
are shown in Fig. 2. There are two size types with 
different sizes on this location: long plot with di- 
mensions of 1.5 mx14.6 m (5 ftx48 ft) and short 
plot with dimensions of 1.5 mx3.6 m (5 ftx12 ft). A 
total of 428 plots were planted with 116 long plots 
and 312 short plots, among which 28 plots were 
used as borders (columns 1, 11, 20, 29, and 55 in 
the right images of Fig. 2; columns 1 and 55 are the 
left- and right-most columns, respectively). While 
generating dataset consisting of individual plot im- 
ages, all these border plots need to be manually re- 
moved. 

Approximately a week after the lodging oc- 
curred, UAS flights were conducted at three heights 
(15, 46, and 91 m) on July 23, 2020. The 91 m is 
the highest above ground level allowed for a UAV 
mission by the U.S. Department of Transportation; 
the 15 m is the empirically determined lowest 
above ground level for a UAV mission based on our 
multiple year experience; and the 46 m is roughly 
half height of the highest above ground level. An 
off-the-shelf DJI Phantom 4D RTK UAS (DJI-Inno- 
vations, Inc., Shenzhen, China) was used for data 
collection. The UAS is outfitted with a 20 megapix- 
el (5472 x 3648 pixel) color (RGB) camera. The 
mission speed was set to 2.5 m/s, and since the 
weather was sunny during the entire data collection 


time window, the balance mode was set as sunny. 
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Thompson, ND 
I 


Note: On the right image, columns 1 (left most), 11, 20, 29, and 55 (right most) are border plots 
Fig. 2 The location of field trials (left) and field trials layout (right) 


Additionally, both side and forward overlaps for im- 
age capture were set as 80%. 

Immediately after all the three flights, two 
evaluators (trained by an experienced plant scien- 
tist) visited the field for visual grading of the lodg- 
ing conditions. For each plot, the evaluators graded 
the lodging level in one of the following three cate- 
gories: non-lodging (lodging event does not occur), 
light lodging (<50% of the plot area lodged), and se- 
vere lodging (>50% of the plot area lodged). This 
study applied the value of 50% as the threshold to 
distinguish light and severe lodging, and different 
breeders may use other threshold values (e. g., 
40%). Thus the performance of models on other 
threshold values may need further test. To avoid in- 
fluence by each other, the two individuals worked 
independently. After collecting the ground truth da- 
ta, the two evaluators and the person who trained 
them compared the individual grades for each plot. 
If the two grades were the same, it remained as it 
was; if the two grades were different, the three indi- 
viduals would vote using the orthomosaic imagery 
(refer to below section) and decided the final grade 
for the plot. At the end of the process, there was on- 


ly one grade for each plot. 


2.2 Data pre-processing and dataset 
generation 


After each mission (three in total), all the col- 
lected images were processed with Pixel4D mapper 
by Pix4D (Pix4D V4.3.33, S. A., Prilly, Switzer- 
land) to generate an orthomosaic of the area (Fig. 2 
right image). To assure alignment of the orthomosa- 
ics across all three flights, eight ground control 
points (installed soon after the experiment was 
planted) were used during the image stitching pro- 
cess in Pix4Dmapper. Ground control points were 
surveyed with a Trimble Geo7X GPS unit, which 
was connected to a virtual base network (VBN), 
providing a 2 cm level accuracy. 

Fig. 3 shows the automatic dataset generation 
process. After the orthomosaics were created, they 
were processed to generate datasets consisting of in- 
dividual plot images. For each pixel, its excess 
green value (Equation 1) was calculated (Fig. 3(b)). 
Then, thresholding (0.1) was conducted on the ex- 
cess green image to generate a binary image— 
black pixel equals to 0 and white pixel equals to 1 
(Fig. 3(c)). 

Excess Green = 2 x Green(G) - Red(R) - 

Blue(B) (1) 
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Note: (a) original image; (b) monochrome of excess green; (c) binary image after thresholding (0.01) excess green image (b); (d) relationship 


between x coordinates (horizontal direction) and sum of each column; (e) relationship between y coordinates (vertical direction) and the sum of 


each row; and (f) samples of automatically generated extracted images of the individual plots 


Fig. 3 General procedure on the development of automatic plot image generation 


The experimental field mainly consists of four 
large blocks (Fig. 3(a)) in the vertical direction, and 
since the four blocks are similar, the approach de- 
veloped for one of the blocks can be used on the 
other three. The top block was selected to explain 
the developed methodology. For individual plot, the 
key parameter in the vertical direction is the top and 


bottom coordinates for both large and small plots. 


For the large plots, the top and bottom lines are R1 
and R6; for the small plots, the boundaries are R1 
and R2, or R3 and R4, or RS and R6 (Fig. 3(c)). For 
the horizontal direction, the key coordinates are the 
left most boundary (C1) and right most boundary 
(C2) (Fig. 3(c)). With the C1 and C2 value, coupled 
with the known information of how many plots 


there are on the images, individual plot's coordi- 
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nates can be automatically created. 

For the binary image (Fig. 3(c)), for each col- 
umn, the sum of pixel values is calculated, which is 
actually the number of white pixels (white pixel is 
equal to 1; black pixel is equal to 0). The relation 
between the image x coordinates (horizontal direc- 
tion) and their corresponding sum of the individual 
column is shown in Fig. 3(d). Due to a sudden 
change from 0 to a very high value, the C1 value 
could be obtained, and the C2 value is actually the 
right edge of the image. Similarly, the relationship 
between the row coordinate (vertical direction) and 
each row's sum is presented in Fig. 3(e), with R1-6 
values calculated automatically. After detecting 
each plot's coordinates, these coordinates were used 
to crop the original images automatically to gener- 
ate the individual plot dataset. Sample images auto- 


matically generated are shown in Fig. 3(e). 
2.3 Machine learning algorithm 


While applying ML models, it requires to first 
extract and then feed features into the model for 
training and testing. In this study, color and textural 
features were extracted. A detailed description of 
those features is shown in Table 1. Since our group 
has published a paper with all features' information 
(e.g., formula and specifical meanings) used in this 
study explained, all the detailed information can be 
found in [19]. The color features were chosen as 
lodged crop plots would have soil and stems shown 
in the images, which may be less green than the nor- 
mal crop growing area. As for the texture features, 
lodged crops may be not as uniform as non-lodged 
crops. A number of classifiers have been used to ad- 
dress classification issues in agriculture, such as K 
nearest neighbors (KNN), random forest (RF), and 
support vector machine (SVM)"'*"!, and since there 


is no clear information about which classifier could 


lead to the highest detection accuracy, this study 
tested all the three commonly used methods (KNN, 
RF, and SVM) that have good performance validat- 
ed in previous research. While applying KNN, a 
crucial parameter affecting the performance of 
KNN is the selection of K value—small value 
would reduce the accuracy, while a large value 
would increase the computational cost. Preliminary 
tests on different K values (5 to 50) were conduct- 
ed, and K = 20 was selected. While taking advan- 
tage of SVM, the most performance affecting pa- 
rameter is the kernel function. There are many ker- 
nel functions can be used, including sigmoid, poly- 
nomial, and radial basis function (RBF). In this 
study, RBF was selected for its robust perfor- 
mance". Regarding RF, out-of-bag error (OOBE) 
was used to evaluate the model's prediction accura- 
cy. Preliminary results showed that OOBE de- 
creased when the tree number was larger than 100. 
Thus, 100 trees were used in the RF model. 
Table 1 Features selected based on domain knowledge 


for wheat lodging ratio detection 


Feature number Feature Description 
1 R Red channel 
2 G Green channel 
3 B Blue Channel 
4 H Hue Channel 
5 S Saturation channel 
6 I Intensity channel 
7 L Luminosity channel 
8 a a channel 
9 b b channel 
10 NDI Normalized difference index 
11 ExG Excess green index 
12 ExR Excess red index 
13 GLCM Texture feature 


Note: Features 1—9 are all normalized; detailed information of 


these features can be referred to Flores et al. '!! 


2.4 Deep learning 


Compared to the complex procedures of using 


ML model to address classification issues, DL mod- 
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els are recently used by researchers. Compared to 
the ML, there is no need to perform the manual fea- 
ture extraction and selection procedure for the DL, 
which is domain knowledge-based and challenging 
for some researchers. There are a number of DL 
models that have been used, such as GoogLeNet, 
VGG16, ResNet101, to address classification prob- 
lems in agriculture”, In this study, since it is un- 
known which DL model has the most satisfactory 
performance, the commonly used and performance 
validated three models (i. e., ResNetl01, VGG16, 
and GoogLeNet) were tested, following by compar- 
ing their performances. The VGG16 network stacks 
layers on top of each other to increase the layer 
number of the network, and "16" stands for the 
number of weight layers. To avoid the vanishing 
gradient and overfitting problems with the in- 
creased number of layers, a residual module was 
added to the ResNet101 model, which makes it pos- 
sible to train deep neural networks. The innovation 
of GoogLeNet relies on the concept of inception 
module, which arranges the convolution and pool- 
ing layers in a parallel manner. Thus, features ex- 
tracted by GoogLeNet are more comprehensive—in- 
cluding both general and local features!” 

While applying DL model, it is desirable to 
have more data for model training and testing. 
Thus, researchers always conduct dataset augmenta- 
tion procedure before running DL models. Dataset 
augmentation is a procedure to manually increase 
the size of existing datasets. This study conducted 
geometric transformation for dataset augmentation, 
including shear, translation, rotation, scaling, and re- 
flection. For individual image, one of the five geo- 
metric transformation approaches was randomly se- 
lected and applied. For each height, the original da- 
taset consisted of 183, 144 and 73 images for non-, 


light-, and severe-lodging, respectively; after the im- 


age augmentation procedure, the dataset consisted 
of 549, 432 and 219 images for non-, light-, and se- 
vere-lodging, respectively. 

All data processing was based on Matlab” 
2020b (The Mathworks, Inc., Natick, Mass., USA), 
including feature extraction and selection, and run- 
ning ML and DL models. The desktop used for this 
study was configured with Windows 10 OS, Intel 
(R) Core(TM) i7-8700 CPU, 32 GB RAM, Intel(R) 
UHD Graphics 630, and 16 GB GPU memory. For 
all the modeling processes, 70% of the dataset was 
randomly selected to be used for training, and the 


other 30% for testing. 


3 Results and discussion 


3.1 Machine learning classification results 


For the ML approach, the SVM, RF and KNN 
classifiers were used and the results of the three 
classifiers’ performance on different heights are 
shown in Fig. 4. Each model was run for 10 times, 
with the averaged detection accuracy used to repre- 
sent the model's performance. For all the three dif- 
ferent heights, among the three models, RF consis- 
tently results in higher accuracy over the other two 
models. For the three different heights, the accuracy 
of RF is at least 4% higher than either of the other 
two models (SVM and KNN). Therefore, among 
the three models, RF should be selected to serve the 
purpose of lodging ratio detection due to its high ac- 
curacy. 

Detailed information of the RF model perfor- 
mance is shown in Fig. 5. The accuracies of 15, 46 
and 91 m altitude are 71.5%, 70.8% and 71.4%, 
which are not significantly different from each oth- 
er (same letter in Fig. 5). The image pixel resolu- 
tions of the 15, 46, and 91 m were 0.4, 1.2 and 2.5 


cm/pixel, respectively, indicating that images with 
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Note: Bars for each group with different letters are significantly dif- 
ferent by Tukey test (a = 0.05) 


Fig. 4 Detection accuracy of different wheat lodging ratios 
based on color and textural features extracted from un- 
manned aerial systems images collected at three flight alti- 
tudes coupled with SVM , random forest (RF and KNN 


classifiers ) 


2.5 cm/pixel resolution performing similarly to im- 
ages with 1.2 and 0.4 cm/pixel. The reason for this 
observation is perhaps both the analysis region of 
interest and the plot scale (1.5 m x 3.6 m and 1.5 m x 
14.6 m) are large. Since higher resolution images re- 
quires more time to be collected, more storage 
space, and more computer power to be processed 
than lower resolution ones, and based on the results 
found on this study, it recommends that future stud- 
ies on wheat lodging detection could be carried out 


using imagery with 2.5 cm/pixel resolution. 
3.2 Deep learning results 


Wheat lodging detection accuracies for three 
different ML models (ResNet101, GoogLeNet and 
VGG16) are shown in Fig. 6. Since the DL model 
ran many iterations, and it uses randomly selected 
data (70% training and the other 30% testing) in 
each iteration, there is no need to run the models 
multiple times. Among the three models, in each 
flight height, ResNet101 and VGG16 consistently 
result in the highest and lowest detection accuracy, 


respectively. The reason for the ResNet101 to lead 


fon) a a ~I 
© aA So n 


Detection accuracy, % 


Nn 
n 


50 
15 m 46 m 91m 


Note: Bars for each group with the same letters are not significantly 
different by Tukey test (a = 0.05) 


Fig. 5 Detection accuracy of different wheat lodging ratio 
based on color and textural features extracted from un- 
manned aerial systems images collected at three flight alti- 


tudes coupled with RF classifier 


to the highest accuracy is probably deep layers 
(more convolutional layers). The more layers the 
model has, the finer features the model could ex- 
tract. The finer features would represent the image's 
characters more accurately, and thus lead to higher 
accuracy. The GoogLeNet has an inception module, 
arranging the convolutional and pooling layers in 
parallel to extract features using different size of fil- 
ters (i.e., 1x1, 3x3, 5x5). By applying different size 
of filters, the inception model could extract both 
general and local features. Since VGG16 could only 
extract features using one size of filter, it explained 
the GoogLeNet generating a higher accuracy over 
VGG16. Considering the superior performance of 
ResNetl101 over GoogLeNet and VGGI16 on all 
three flight heights, ResNet101 is recommended to 
be used for wheat lodging ratio detection. For the 
ResNetl01, the detection accuracies are 75.2%, 
75.1%, and 75.3% for the three flight heights of 
15 m (image resolution 0.4 cm/pixel), 46 m (image 
resolution 1.2 cm/pixel) and 91 m (image resolution 
2.5 cm/pixel), respectively. Generally, similar to RF, 
the UAV flight height does not affect the detection 
accuracy significantly by using the ResNe101. 


32 


ChinaX iva (ERAT! 


202302.00212v1 


chinaXiv 


a oo 
© =] 


Detection accuracy/ % 
eN 
© 


50 
40 
30 
15 46 91 
Flight hight/m 
E ResNet101 OGoogLeNet BVGG16 


Fig. 6 Detection accuracy of wheat lodging ratio based on col- 
or and textural features extracted from unmanned aerial sys- 
tems images collected at three flight altitudes coupled with 


ResNet100, GoogLeNet and VGG16 deep learning models 


3.3 Comparison of ML and DL 


The detection accuracies comparison between 
ML (RF) and DL (ResNet101) models are shown in 
Table 2. For all the three mission heights, 
ResNet101 has a higher accuracy over RF consis- 
tently. A potential reason is that the manually ex- 
tracted features are limited, and there are some oth- 
er meaningful features that could represent the im- 
age properties are missed, or have not been identi- 
fied/incorporated by researchers. The miss of these 
meaningful features lowers the model accuracy. 
However, the ResNet101 model takes advantage of 
convolutional neural networks to extract deep fea- 
tures, which could extract very fine deep features. 
Thus, since the deep features could represent the im- 
age's fine characteristics, it would represent the im- 
ages more properly and comprehensively over the 
manually extracted features. 

After comparing the performance of RF and 
ResNet101, the ResNet101 is recommended to be 
used for wheat lodging detection. Since different 
flight heights do not affect the detection accuracy, it 
is recommended to use the 91 m (300 ft.) images 
over the 15 m (50 ft.) images for wheat lodging de- 


tection due to the high data collection speed and 


small size of dataset. 


Table 2 Detection accuracy comparison between RF and 


ResNet101 at different flight heights 


Unit: % 
Flight hight 
Model 
15m (50 ft.) 46m(150ft.) 91m (300 ft.) 
RF 71.5 70.8 71.4 
ResNet101 75.2 TSA 75.3 


4 Conclusion 


This study has tested the performance of the 
machine learning models (SVM, RF and KNN) and 
deep learning model (ResNet101, GoogLeNet, and 
VGG16) on wheat lodging ratio detection at three 
different heights. A majority of studies related to im- 
age classification have used a manual approach to 
generate individual plot images from field trials, 
which is time-consuming, inefficient, and inaccu- 
rate. A new approach for automatic field plot datas- 
et generation, which based on image pre-processing 
using excess green index was developed. This new 
and automatic dataset generation approach was ap- 
plied and validated, and could be applied to a simi- 
lar workflow by other researchers. For both RF and 
ResNet101, the detection accuracies were not signif- 
icantly different for the three flight altitudes (15, 46 
and 91 m) tested, which had image resolutions of 
0.4, 1.2 and 2.5 cm/pixel, respectively. Since flying 
at high altitude did not affect the models' accuracy 
and increase efficiencies related to data collection 
and processing, itis recommended to use 2.5 cm/pixel 
resolution, instead of the 0.4 or 1.2 cm/pixel resolu- 
tions, for wheat lodging ratio detection. Regarding 
the SVM and ResNet101, the ResNet101 is recom- 
mended for wheat lodging ratio detection because 
of its higher accuracy and simple application proce- 
dure (free of manual feature selection). This study 


has demonstrated that using low-resolution images 
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(2.5 cm/pixel) coupled with ResNet101 is an effi- 
cient approach for wheat lodging ratio detection, 
with an accuracy of 75%. 

There are a number of parameters that can be 
used to compare the performance of models, such 
as model size, computation cost, and accuracy. 
Since this study was insensitive to model size and 
computation cost, and researchers were more con- 
cerned on the detection accuracy, authors chose the 
accuracy metric to compare the model performance. 
Further research can be conducted to provide a com- 


prehensive comparison between different models. 
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GEPEM IM SAE AL AE REL AEA EGA BE IIE RT 58102 , RE) 


HO Bs NATE WEP AA AR a Se, AE SS EE a oP ARAR A R BB 
WEL. PESRNI TIER Te AE OER, MARRIR, Sa He TINT EL AE SC SEAS EB. EP 


— TALL, RRE S BPE ABU BE HE EE TIE. AERIAN, 46 #91 m = he 
RAR AB AGE; RAR TERS. WETER, AIK TEAR <50% MAR I FA>S0% FY ba HEE RT A: — BH ve 
KE OLHEAT A TE TA RAS POLAR CSC RRL BEL. BEL AR AK KGS) P= BPR BEA A 
(ResNet101, GoogLeNet, VGG16) REXIN BREM OLUETT TR, MRR, ResNetlO1 MAE 24 RE 
PEHLA, FF ALTE 91 m Tey REAR SE AY IHR OP SS A EAMT TE 15 m BEER EY BE. AIEN T EFATE 
91 mm ERE IC APLAR, RH ResNetl01 XNE (8 1h REM EP RAT TR, BE 
Wt BIA BIT 75% 

Kei: DEAK; HD; PRES; BREE; ANL; ResNetl01 
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